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<223> 1374-2453 intron, 2709-4316 intron 
<400> 1 

gcttgagggc ctgcggttat gggacgttgc agtttgcgcc tactcaaaga tgccggtttt 60 

ataacggaga aaaatgggga gctattacga tctctccttg atgtggggtt tacaataaag 120 

cctggattgt tctacaagat tatgggggat gaaag atg ate etc gac act gac 173 

Met lie Leu Asp Thr Asp 
1 5 



tac ata ace gag gat gga aag cct gtc ata aga att ttc aag aag gaa 
Tyr lie Thr Glu Asp Gly Lys Pro Val lie Arg lie Phe Lys Lys Glu 
10 15 20 



221 



aac ggc gag ttt aag att gag tac gac egg act ttt gaa ccc tac ttc 
Asn Gly Glu Phe Lys lie Glu Tyr Asp Arg Thr Phe Glu Pro Tyr Phe 
25 30 35 



269 



tac gec etc ctg aag gac gat tct gee att gag gaa gtc aag aag ata 
Tyr Ala Leu Leu Lys Asp Asp Ser Ala lie Glu Glu Val Lys Lys lie 
40 45 50 



317 



ace gee gag agg cac ggg acg gtt gta acg gtt aag egg gtt gaa aag 
Thr Ala Glu Arg His Gly Thr Val Val Thr Val Lys Arg Val Glu Lys 
55 60 65 70 



365 



gtt cag aag aag ttc etc ggg aga cca gtt gag gtc tgg aaa etc tac 
Val Gin Lys Lys Phe Leu Gly Arg Pro Val Glu Val Trp Lys Leu Tyr 
75 80 85 



413 



ttt act cat ccg cag gac gtc cca gcg ata agg gac aag ata cga gag 
Phe Thr His Pro Gin Asp Val Pro Ala lie Arg Asp Lys lie Arg Glu 
90 95 100 



461 



cat ggg gca gtt att gac ate tac gag tac gac ata ccc ttc gee aag 
His Gly Ala Val lie Asp lie Tyr Glu Tyr Asp lie Pro Phe Ala Lys 
105 110 115 



509 



cgc tac etc ata gac aag gga tta gtg cca atg gaa ggc gac gag gag 557 



Arg Tyr Leu lie Asp Lys Gly Leu Val Pro Met Glu Gly Asp Glu Glu 
120 125 130 



ctg aaa atg etc gec ttc gac att gaa act etc tac cat gag ggc gag 605 
Leu Lys Met Leu Ala Phe Asp He Glu Thr Leu Tyr His Glu Gly Glu 
135 140 145 150 

gag ttc gee gag ggg cca ate ctt atg ata age tac gec gac gag gaa 653 
Glu Phe Ala Glu Gly Pro He Leu Met He Ser Tyr Ala Asp Glu Glu 
155 160 165 

ggg gec agg gtg ata act tgg aag aac gtg gat etc ccc tac gtt gac 701 
Gly Ala Arg Val He Thr Trp Lys Asn Val Asp Leu Pro Tyr Val Asp 
170 175 180 

gtc gtc teg acg gag agg gag atg ata aag cgc ttc etc cgt gtt gtg 74 9 
Val Val Ser Thr Glu Arg Glu Met He Lys Arg Phe Leu Arg Val Val 
185 190 195 

aag gag aaa gac ccg gac gtt etc ata ace tac aac ggc gac aac ttc 797 
Lys Glu Lys Asp Pro Asp Val Leu He Thr Tyr Asn Gly Asp Asn Phe 
200 205 210 

gac ttc gee tat ctg aaa aag cgc tgt gaa aag etc gga ata aac ttc 845 
Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu Lys Leu Gly He Asn Phe 
215 220 225 230 

gee etc gga agg gat gga age gag ccg aag att cag agg atg ggc gac 8 93 
Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys He Gin Arg Met Gly Asp 
235 240 245 

agg ttt gec gtc gaa gtg aag gga egg ata cac ttc gat etc tat cct 941 
Arg Phe Ala Val Glu Val Lys Gly Arg He His Phe Asp Leu Tyr Pro 
250 255 260 

gtg ata aga egg acg ata aac ctg ccc aca tac acg ctt gag gee gtt 989 
Val He Arg Arg Thr He Asn Leu Pro Thr Tyr Thr Leu Glu Ala Val 
265 270 275 

tat gaa gee gtc ttc ggt cag ccg aag gag aag gtt tac get gag gaa 1037 
Tyr Glu Ala Val Phe Gly Gin Pro Lys Glu Lys Val Tyr Ala Glu Glu 
280 285 290 

ata acc aca gec tgg gaa acc ggc gag aac ctt gag aga gtc gec cgc 1085 
He Thr Thr Ala Trp Glu Thr Gly Glu Asn Leu Glu Arg Val Ala Arg 
295 300 305 310 

tac teg atg gaa gat gcg aag gtc aca tac gag ctt ggg aag gag ttc 1133 
Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr Glu Leu Gly Lys Glu Phe 
315 320 325 

ctt ccg atg gag gee cag ctt tct cgc tta ate ggc cag tec etc tgg 1181 
Leu Pro Met Glu Ala Gin Leu Ser Arg Leu He Gly Gin Ser Leu Trp 
330 335 340 

gac gtc tec cgc tec age act ggc aac etc gtt gag tgg ttc etc etc 1229 
Asp Val Ser Arg Ser Ser Thr Gly Asn Leu Val Glu Trp Phe Leu Leu 



345 



350 



355 



agg aag gcc 
Arg Lys Ala 
360 

aag gag ctg 
Lys Glu Leu 
375 

gag ccc gag 
Glu Pro Glu 



tgc cat cca 
Cys His Pro 



aac ate age 
Asn lie Ser 
425 

tgg cag aga 
Trp Gin Arg 
440 

gta aac ata 
Val Asn He 
455 

gtt aca aag 
Val Thr Lys 



tct ttc ctt 
Ser Phe Leu 



ttc tat gaa 
Phe Tyr Glu 
505 

gtt etc aag 
Val Leu Lys 
520 

ttg agg aaa 
Leu Arg Lys 
535 

att tea cac 
He Ser His 



gag ttt agg 
Glu Phe Arg 



tat gag agg 
Tyr Glu Arg 



gcc aga aga 
Ala Arg Arg 
380 

aga ggg ttg 
Arg Gly Leu 
395 

gcc gat acg 
Ala Asp Thr 
410 

gag gtt cag 
Glu Val Gin 



gtt aga aaa 
Val Arg Lys 



aac ggg tta 
Asn Gly Leu 
460 

aac gaa cga 
Asn Glu Arg 
475 

act aaa aaa 
Thr Lys Lys 
490 

ata ggc aga 
He Gly Arg 



gga gag etc 
Gly Glu Leu 



gac gtt gaa 
Asp Val Glu 
540 

cag tat cgt 
Gin Tyr Arg 
555 

gat cgt ate 
Asp Arg He 
570 



aat gag ctg 
Asn Glu Leu 
365 

egg cag age 
Arg Gin Ser 



tgg gag aac 
Trp Glu Asn 



aag gtt gtc 
Lys Val Val 
415 

gaa ggt gac 
Glu Gly Asp 
430 

gta tgg gaa 
Val Trp Glu 
445 

aag tgt acg 
Lys Cys Thr 



caa acg aga 
Gin Thr Arg 



gtt aag ggc 
Val Lys Gly 
495 

gcg aca agt 
Ala Thr Ser 
510 

get ggc ata 
Ala Gly He 
525 

tac ttt gat 
Tyr Phe Asp 



gtt gag ata 
Val Glu He 



aca tac att 
Thr Tyr He 
575 



gcc ccg aac 
Ala Pro Asn 
370 

tat gaa gga 
Tyr Glu Gly 
385 

ata gtg tac 
He Val Tyr 
400 

gtc aag ggg 
Val Lys Gly 



tat gtc ctt 
Tyr Val Leu 



tac gac tac 
Tyr Asp Tyr 
450 

ccc aat cat 
Pro Asn His 
465 

ata aga gac 
He Arg Asp 
480 

aag ata ata 
Lys He He 



gag aat att 
Glu Asn He 



eta ttg get 
Leu Leu Ala 
530 

tea tec cgc 
Ser Ser Arg 
545 

acc att ggg 
Thr He Gly 
560 

ttt gag cgt 
Phe Glu Arg 



aag ccc gat 
Lys Pro Asp 



ggc tat gta 
Gly Tyr Val 



eta gat ttt 
Leu Asp Phe 
405 

aag ggg att 
Lys Gly He 
420 

ggg att gac 
Gly He Asp 
435 

aaa ggg gag 
Lys Gly Glu 



aag ctt ccc 
Lys Leu Pro 



agt ctt get 
Ser Leu Ala 
485 

acc act ccc 
Thr Thr Pro 
500 

cca gaa gaa 
Pro Glu Glu 
515 

gaa gga acg 
Glu Gly Thr 



aaa aaa egg 
Lys Lys Arg 



aaa gac gag 
Lys Asp Glu 
565 

ttg ttt ggg 
Leu Phe Gly 
580 



gaa 1277 
Glu 



aaa 1325 

Lys 

390 

aga 1373 
Arg 



ata 1421 
He 



ggc 1469 
Gly 



ctt 1517 
Leu 



gtt 1565 

Val 

470 

aag 1613 
Lys 



ctt 1661 
Leu 



gag 1709 
Glu 



etc 1757 
Leu 



agg 1805 

Arg 

550 

gag 1853 
Glu 



att 1901 
He 



act cca age 
Thr Pro Ser 
585 

gtt gcg aag 
Val Ala Lys 
600 

ata gag tec 
He Glu Ser 
615 

gac ggt tea 
Asp Gly Ser 



aca aag aac 
Thr Lys Asn 



ctt ggt ate 
Leu Gly He 
665 

aaa gat egg 
Lys Asp Arg 
680 

ata ctg ttc 
He Leu Phe 
695 

ctg ctt aat 
Leu Leu Asn 



aat gga ttt 
Asn Gly Phe 



gag ggc aag 
Glu Gly Lys 
745 

gec aat ggc 
Ala Asn Gly 
760 

acc cac aac 
Thr His Asn 
775 

tat gac gtt 
Tyr Asp Val 



ate teg gag 
He Ser Glu 



aag aat gtt 
Lys Asn Val 



eta cat gec 
Leu His Ala 
620 

gta aac agg 
Val Asn Arg 
635 

gag tgg aag 
Glu Trp Lys 
650 

cct cat caa 
Pro His Gin 



age agg tat 
Ser Arg Tyr 



caa aca etc 
Gin Thr Leu 
700 

aag gca ata 
Lys Ala He 
715 

tac agg etc 
Tyr Arg Leu 
730 

gtc tat gac 
Val Tyr Asp 



ata ttg acc 
He Leu Thr 



gtc teg ccg 
Val Ser Pro 
780 

gec cca cag 
Ala Pro Gin 
795 



aag aaa gga 
Lys Lys Gly 
590 

tat ctt aaa 
Tyr Leu Lys 
605 

ccc teg gtt 
Pro Ser Val 



gtt agg agg 
Val Arg Arg 



att aaa ctg 
He Lys Leu 
655 

acg tac acg 
Thr Tyr Thr 
670 

ata ctg gag 
lie Leu Glu 
685 

att gga ttc 
He Gly Phe 



tct cag agg 
Ser Gin Arg 



agt gaa ttc 
Ser Glu Phe 
735 

tta act ctt 
Leu Thr Leu 
750 

cat aac tec 
His Asn Ser 
765 

gat acg etc 
Asp Thr Leu 



gtc ggc cac 
Val Gly His 



act aac gca 
Thr Asn Ala 



gtc aag gaa 
Val Lys Glu 
610 

etc agg gga 
Leu Arg Gly 
625 

agt att gtt 
Ser He Val 
640 

gtg tea aaa 
Val Ser Lys 



tat cag tat 
Tyr Gin Tyr 



ata act gga 
He Thr Gly 
690 

ate agt gaa 
He Ser Glu 
705 

gaa atg aac 
Glu Met Asn 
720 

aat gtc age 
Asn Val Ser 



gaa gga act 
Glu Gly Thr 



ctg tac ccc 
Leu Tyr Pro 
770 

aac aga gaa 
Asn Arg Glu 
785 

cgc ttc tgc 
Arg Phe Cys 
800 



gta aca etc 
Val Thr Leu 
595 

att atg gac 
He Met Asp 



ttc ttc gaa 
Phe Phe Glu 



gca acc cag 
Ala Thr Gin 
645 

ctg etc tec 
Leu Leu Ser 
660 

cag gaa aat 
Gin Glu Asn 
675 

aag gac gga 
Lys Asp Gly 



aga aag aac 
Arg Lys Asn 



aac ttg gaa 
Asn Leu Glu 
725 

acg gaa tac 
Thr Glu Tyr 
740 

ccc tac tac 
Pro Tyr Tyr 
755 

tea ate ate 
Ser He He 



gga tgc aag 
Gly Cys Lys 



aag gac ttc 
Lys Asp Phe 
805 



aaa 1949 
Lys 



aac 1997 
Asn 



ggc 2045 

Gly 

630 

ggt 2093 
Gly 



cag 2141 
Gin 



ggg 218 9 
Gly 



ttg 2237 
Leu 



get 2285 

Ala 

710 

aac 2333 
Asn 



tat 2381 
Tyr 



ttt 2429 
Phe 



ate 2477 
He 



gaa 2525 

Glu 

790 

cca 2573 
Pro 



gga ttt ate ccg age ctg ctt gga gac etc eta gag gag agg cag aag 2621 
Gly Phe lie Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gin Lys 
810 815 820 

ata aag aag aag atg aag gee acg att gac ccg ate gag agg aag etc 2669 
lie Lys Lys Lys Met Lys Ala Thr lie Asp Pro lie Glu Arg Lys Leu 
825 830 835 

etc gat tac agg cag agg gee ate aag ate ctg gca aac age ate eta 2717 
Leu Asp Tyr Arg Gin Arg Ala lie Lys lie Leu Ala Asn Ser lie Leu 
840 845 850 

ccc gag gaa tgg ctt cca gtc etc gag gaa ggg gag gtt cac ttc gtc 2765 
Pro Glu Glu Trp Leu Pro Val Leu Glu Glu Gly Glu Val His Phe Val 
855 860 865 870 

agg att gga gag etc ata gac egg atg atg gag gaa aat get ggg aaa 2813 
Arg lie Gly Glu Leu lie Asp Arg Met Met Glu Glu Asn Ala Gly Lys 
875 880 885 

gta aag aga gag ggc gag acg gaa gtg ctt gag gtc agt ggg ctt gaa 2 861 
Val Lys Arg Glu Gly Glu Thr Glu Val Leu Glu Val Ser Gly Leu Glu 
890 895 900 

gtc ccg tec ttt aac agg aga act aac aag gec gag etc aag aga gta 2 909 
Val Pro Ser Phe Asn Arg Arg Thr Asn Lys Ala Glu Leu Lys Arg Val 
905 910 915 

aag gee ctg att agg cac gat tat tct ggc aag gtc tac acc ate aga 2 957 
Lys Ala Leu lie Arg His Asp Tyr Ser Gly Lys Val Tyr Thr lie Arg 
920 925 930 

ctg aag teg ggg agg aga ata aag ata acc tct ggc cac age etc ttc 3005 
Leu Lys Ser Gly Arg Arg lie Lys lie Thr Ser Gly His Ser Leu Phe 
935 940 945 950 

tct gtg aga aac ggg gag etc gtt gaa gtt acg ggc gat gaa eta aag 3053 
Ser Val Arg Asn Gly Glu Leu Val Glu Val Thr Gly Asp Glu Leu Lys 
955 960 965 

cca ggt gac etc gtt gca gtc ccg egg aga ttg gag ctt cct gag aga 3101 
Pro Gly Asp Leu Val Ala Val Pro Arg Arg Leu Glu Leu Pro Glu Arg 
970 975 980 

aac cac gtg ctg aac etc gtt gaa ctg etc ctt gga acg cca gaa gaa 314 9 
Asn His Val Leu Asn Leu Val Glu Leu Leu Leu Gly Thr Pro Glu Glu 
985 990 995 

gaa act ttg gac ate gtc atg acg ate cca gtc aag ggt aag aag aac 3197 
Glu Thr Leu Asp lie Val Met Thr lie Pro Val Lys Gly Lys Lys Asn 
1000 1005 1010 

ttc ttt aaa ggg atg etc agg act ttg cgc tgg att ttc gga gag gaa 3245 
Phe Phe Lys Gly Met Leu Arg Thr Leu Arg Trp lie Phe Gly Glu Glu 
1015 1020 1025 1030 

aag agg ccc aga acc gcg aga cgc tat etc agg cac ctt gag gat ctg 3293 



Lys Arg Pro Arg Thr Ala Arg Arg Tyr Leu Arg His Leu Glu Asp Leu 
1035 1040 1045 



ggc tat gtc egg ctt aag aag ate ggc tac gaa gtc etc gac tgg gac 3341 
Gly Tyr Val Arg Leu Lys Lys lie Gly Tyr Glu Val Leu Asp Trp Asp 
1050 1055 1060 

tea ctt aag aac tac aga agg etc tac gag gcg ctt gtc gag aac gtc 3389 
Ser Leu Lys Asn Tyr Arg Arg Leu Tyr Glu Ala Leu Val Glu Asn Val 
1065 1070 1075 

aga tac aac ggc aac aag agg gag tac etc gtt gaa ttc aat tec ate 3437 
Arg Tyr Asn Gly Asn Lys Arg Glu Tyr Leu Val Glu Phe Asn Ser lie 
1080 1085 1090 

egg gat gca gtt ggc ata atg ccc eta aaa gag ctg aag gag tgg aag 3485 
Arg Asp Ala Val Gly lie Met Pro Leu Lys Glu Leu Lys Glu Trp Lys 
1095 1100 1105 1110 

ate ggc acg ctg aac ggc ttc aga atg aga aag etc att gaa gtg gac 3533 
lie Gly Thr Leu Asn Gly Phe Arg Met Arg Lys Leu lie Glu Val Asp 
1115 1120 1125 

gag teg tta gca aag etc etc ggc tac tac gtg age gag ggc tat gca 3581 
Glu Ser Leu Ala Lys Leu Leu Gly Tyr Tyr Val Ser Glu Gly Tyr Ala 
1130 1135 1140 

aga aag cag agg aat ccc aaa aac ggc tgg age tac age gtg aag etc 3629 
Arg Lys Gin Arg Asn Pro Lys Asn Gly Trp Ser Tyr Ser Val Lys Leu 
1145 1150 1155 

tac aac gaa gac cct gaa gtg ctg gac gat atg gag aga etc gee age 3677 
Tyr Asn Glu Asp Pro Glu Val Leu Asp Asp Met Glu Arg Leu Ala Ser 
1160 1165 1170 

agg ttt ttc ggg aag gtg agg egg ggc agg aac tac gtt gag ata ccg 3725 
Arg Phe Phe Gly Lys Val Arg Arg Gly Arg Asn Tyr Val Glu lie Pro 
1175 1180 1185 1190 

aag aag ate ggc tac ctg etc ttt gag aac atg tgc ggt gtc eta gcg 3773 
Lys Lys lie Gly Tyr Leu Leu Phe Glu Asn Met Cys Gly Val Leu Ala 
1195 1200 1205 

gag aac aag agg att ccc gag ttc gtc ttc acg tec ccg aaa ggg gtt 3821 
Glu Asn Lys Arg lie Pro Glu Phe Val Phe Thr Ser Pro Lys Gly Val 
1210 1215 1220 

egg ctg gec ttc ctt gag ggg tac tea teg gcg atg gcg acg tec ace 3869 
Arg Leu Ala Phe Leu Glu Gly Tyr Ser Ser Ala Met Ala Thr Ser Thr 
1225 1230 1235 

gaa caa gag act cag get etc aac gaa aag cga get tta gcg aac cag 3 917 
Glu Gin Glu Thr Gin Ala Leu Asn Glu Lys Arg Ala Leu Ala Asn Gin 
1240 1245 1250 

etc gtc etc etc ttg aac teg gtg ggg gtc tct get gta aaa ctt ggg 3965 
Leu Val Leu Leu Leu Asn Ser Val Gly Val Ser Ala Val Lys Leu Gly 



1255 



1260 



1265 



1270 



cac gac age ggc gtt tac agg gtc tat ata aac gag gag etc ccg ttc 4013 
His Asp Ser Gly Val Tyr Arg Val Tyr lie Asn Glu Glu Leu Pro Phe 
1275 1280 1285 

gta aag ctg gac aag aaa aag aac gec tac tac tea cac gtg ate ccc 4061 
Val Lys Leu Asp Lys Lys Lys Asn Ala Tyr Tyr Ser His Val lie Pro 
1290 1295 1300 

aag gaa gtc ctg age gag gtc ttt ggg aag gtt ttc cag aaa aac gtc 4109 
Lys Glu Val Leu Ser Glu Val Phe Gly Lys Val Phe Gin Lys Asn Val 
1305 1310 1315 

agt cct cag acc ttc agg aag atg gtc gag gac gga aga etc gat ccc 4157 
Ser Pro Gin Thr Phe Arg Lys Met Val Glu Asp Gly Arg Leu Asp Pro 
1320 1325 1330 

gaa aag gec cag agg etc tec tgg etc att gag ggg gac gta gtg etc 4205 
Glu Lys Ala Gin Arg Leu Ser Trp Leu lie Glu Gly Asp Val Val Leu 
1335 1340 1345 1350 

gac cgc gtt gag tec gtt gat gtg gaa gac tac gat ggt tat gtc tat 4253 
Asp Arg Val Glu Ser Val Asp Val Glu Asp Tyr Asp Gly Tyr Val Tyr 
1355 1360 1365 

gac ctg age gtc gag gac aac gag aac ttc etc gtt ggc ttt ggg ttg 43 01 
Asp Leu Ser Val Glu Asp Asn Glu Asn Phe Leu Val Gly Phe Gly Leu 
1370 1375 1380 

gtc tat get cac aac age tac tac ggt tac tac ggc tat gca agg gcg 4349 
Val Tyr Ala His Asn Ser Tyr Tyr Gly Tyr Tyr Gly Tyr Ala Arg Ala 
1385 1390 1395 

cgc tgg tac tgc aag gag tgt gca gag age gta acg gec tgg gga agg 4 3 97 
Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser Val Thr Ala Trp Gly Arg 
1400 1405 1410 

gag tac ata acg atg acc ate aag gag ata gag gaa aag tac ggc ttt 4445 
Glu Tyr lie Thr Met Thr lie Lys Glu lie Glu Glu Lys Tyr Gly Phe 
1415 1420 1425 1430 

aag gta ate tac age gac acc gac gga ttt ttt gee aca ata cct gga 4493 
Lys Val lie Tyr Ser Asp Thr Asp Gly Phe Phe Ala Thr lie Pro Gly 
1435 1440 1445 

gec gat get gaa acc gtc aaa aag aag get atg gag ttc etc aag tat 4541 
Ala Asp Ala Glu Thr Val Lys Lys Lys Ala Met Glu Phe Leu Lys Tyr 
1450 1455 1460 

ate aac gee aaa ctt ccg ggc gcg ctt gag etc gag tac gag ggc ttc 4589 
lie Asn Ala Lys Leu Pro Gly Ala Leu Glu Leu Glu Tyr Glu Gly Phe 
1465 1470 1475 

tac aaa cgc ggc ttc ttc gtc acg aag aag aag tat gcg gtg ata gac 4637 
Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys Lys Tyr Ala Val lie Asp 
1480 1485 1490 



gag gaa ggc aag ata aca acg cgc gga ctt gag att gtg agg cgt gac 4685 
Glu Glu Gly Lys He Thr Thr Arg Gly Leu Glu He Val Arg Arg Asp 
1495 1500 1505 1510 

tgg age gag ata gcg aaa gag acg cag gcg agg gtt ctt gaa get ttg 4733 
Trp Ser Glu He Ala Lys Glu Thr Gin Ala Arg Val Leu Glu Ala Leu 
1515 1520 1525 

eta aag gac ggt gac gtc gag aag gee gtg agg ata gtc aaa gaa gtt 4781 
Leu Lys Asp Gly Asp Val Glu Lys Ala Val Arg He Val Lys Glu Val 
1530 1535 1540 

ace gaa aag ctg age aag tac gag gtt ccg ccg gag aag ctg gtg ate 4829 
Thr Glu Lys Leu Ser Lys Tyr Glu Val Pro Pro Glu Lys Leu Val He 
1545 1550 1555 

cac gag cag ata acg agg gat tta aag gac tac aag gca acc ggt ccc 4877 
His Glu Gin He Thr Arg Asp Leu Lys Asp Tyr Lys Ala Thr Gly Pro 
1560 1565 1570 

cac gtt gec gtt gee aag agg ttg gee gcg aga gga gtc aaa ata cgc 4 925 
His Val Ala Val Ala Lys Arg Leu Ala Ala Arg Gly Val Lys He Arg 
1575 1580 1585 1590 

cct gga acg gtg ata age tac ate gtg etc aag ggc tct ggg agg ata 4973 
Pro Gly Thr Val He Ser Tyr He Val Leu Lys Gly Ser Gly Arg He 
1595 1600 1605 

ggc gac agg gcg ata ccg ttc gac gag ttc gac ccg acg aag cac aag 5021 
Gly Asp Arg Ala He Pro Phe Asp Glu Phe Asp Pro Thr Lys His Lys 
1610 1615 1620 

tac gac gee gag tac tac att gag aac cag gtt etc cca gee gtt gag 5069 
Tyr Asp Ala Glu Tyr Tyr He Glu Asn Gin Val Leu Pro Ala Val Glu 
1625 1630 1635 



aga att ctg aga gee ttc ggt tac cgc 
Arg lie Leu Arg Ala Phe Gly Tyr Arg 
1640 1645 

aag acg aga cag gtt ggt ttg agt get 
Lys Thr Arg Gin Val Gly Leu Ser Ala 
1655 1660 



aag gaa gac ctg cgc tac cag 5117 
Lys Glu Asp Leu Arg Tyr Gin 
1650 

tgg ctg aag ccg aag gga act 5165 
Trp Leu Lys Pro Lys Gly Thr 
1665 1670 



tgacctttcc atttgttttc cageggataa ccctttaact tccctttcaa aaactccctt 5225 
tagggaaaga ccatgaagat agaaatcegg cggcgcccgg ttaaataege taggatagaa 52 85 
gtgaagccag aeggcagggt agtegtcact gccccgaggg ttcaacgttg agaagtt 5342 



<210> 2 

<211> 774 

<212> PRT 

<213> Pyrococcus kodakaraensis 



<400> 2 

Met lie Leu Asp Thr Asp Tyr lie Thr Glu Asp Gly Lys Pro Val lie 
15 10 15 



Arg lie Phe Lys Lys Glu Asn Gly Glu Phe Lys lie Glu Tyr Asp Arg 
20 25 30 

Thr Phe Glu Pro Tyr Phe Tyr Ala Leu Leu Lys Asp Asp Ser Ala lie 
35 40 45 

Glu Glu Val Lys Lys He Thr Ala Glu Arg His Gly Thr Val Val Thr 
50 55 60 

Val Lys Arg Val Glu Lys Val Gin Lys Lys Phe Leu Gly Arg Pro Val 
65 70 75 80 

Glu Val Trp Lys Leu Tyr Phe Thr His Pro Gin Asp Val Pro Ala He 
85 90 95 

Arg Asp Lys He Arg Glu His Pro Ala Val He Asp He Tyr Glu Tyr 
100 105 110 

Asp He Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu Val Pro 
115 120 125 

Met Glu Gly Asp Glu Glu Leu Lys Met Leu Ala Phe Asp He Glu Thr 
130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Ala Glu Gly Pro He Leu Met He 
145 150 155 160 

Ser Tyr Ala Asp Glu Glu Gly Ala Arg Val He Thr Trp Lys Asn Val 
165 170 175 

Asp Leu Pro Tyr Val Asp Val Val Ser Thr Glu Arg Glu Met He Lys 
180 185 190 

Arg Phe Leu Arg Val Val Lys Glu Lys Asp Pro Asp Val Leu He Thr 
195 200 205 

Tyr Asn Gly Asp Asn Phe Asp Phe Ala Tyr Leu Lys Lys Arg Cys Glu 
210 215 220 

Lys Leu Gly He Asn Phe Ala Leu Gly Arg Asp Gly Ser Glu Pro Lys 
225 230 235 240 

He Gin Arg Met Gly Asp Arg Phe Ala Val Glu Val Lys Gly Arg He 
245 250 255 

His Phe Asp Leu Tyr Pro Val He Arg Arg Thr He Asn Leu Pro Thr 
260 265 270 

Tyr Thr Leu Glu Ala Val Tyr Glu Ala Val Phe Gly Gin Pro Lys Glu 
275 280 285 



Lys Val Tyr Ala Glu Glu He Thr Thr Ala Trp Glu Thr Gly Glu Asn 
290 295 300 



Leu Glu Arg Val Ala Arg Tyr Ser Met Glu Asp Ala Lys Val Thr Tyr 
305 310 315 320 



Glu Leu Gly Lys Glu Phe Leu Pro Met Glu Ala Gin Leu Ser Arg Leu 
325 330 • 335 

lie Gly Gin Ser Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 
340 345 350 

Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Leu Ala 
355 360 365 

Pro Asn Lys Pro Asp Glu Lys Glu Leu Ala Arg Arg Arg Gin Ser Tyr 
370 375 380 

Glu Gly Gly Tyr Val Lys Glu Pro Glu Arg Gly Leu Trp Glu Asn lie 
385 390 395 400 

Val Tyr Leu Asp Phe Arg Ser Leu Tyr Pro Ser lie lie lie Thr His 
405 410 415 

Asn Val Ser Pro Asp Thr Leu Asn Arg Glu Gly Cys Lys Glu Tyr Asp 
420 425 430 

Val Ala Pro Gin Val Gly His Arg Phe Cys Lys Asp Phe Pro Gly Phe 
435 440 445 

lie Pro Ser Leu Leu Gly Asp Leu Leu Glu Glu Arg Gin Lys lie Lys 
450 455 460 

Lys Lys Met Lys Ala Thr lie Asp Pro lie Glu Arg Lys Leu Leu Asp 
465 470 475 480 

Tyr Arg Gin Arg Ala lie Lys lie Leu Ala Asn Ser Tyr Tyr Gly Tyr 
485 490 495 

Tyr Gly Tyr Ala Arg Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu Ser 
500 505 510 

Val Thr Ala Trp Gly Arg Glu Tyr lie Thr Met Thr lie Lys Glu lie 
515 520 525 

Glu Glu Lys Tyr Gly Phe Lys Val lie Tyr Ser Asp Thr Asp Gly Phe 
530 535 540 

Phe Ala Thr lie Pro Gly Ala Asp Ala Glu Thr Val Lys Lys Lys Ala 
545 550 555 560 

Met Glu Phe Leu Lys Tyr lie Asn Ala Lys Leu Pro Gly Ala Leu Glu 
565 570 575 



Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys Lys 
580 585 590 



Lys Tyr Ala Val lie Asp Glu Glu Gly Lys He Thr Thr Arg Gly Leu 
595 600 605 



Glu lie Val Arg Arg Asp Trp Ser Glu lie Ala Lys Glu Thr Gin Ala 
610 615 620 



Arg Val Leu Glu 
625 

Arg lie Val Lys 



Pro Glu Lys Leu 
660 

Tyr Lys Ala Thr 
675 

Arg Gly Val Lys 
690 

Lys Gly Ser Gly 
705 

Asp Pro Thr Lys 



Val Leu Pro Ala 
740 

Glu Asp Leu Arg 
755 

Leu Lys Pro Lys 
770 



Ala Leu Leu Lys 
630 

Glu Val Thr Glu 
645 

Val He His Glu 



Gly Pro His Val 
680 

He Arg Pro Gly 
695 

Arg He Gly Asp 
710 

His Lys Tyr Asp 
725 

Val Glu Arg He 



Tyr Gin Lys Thr 
760 

Gly Thr 



Asp Gly Asp Val 
635 

Lys Leu Ser Lys 
650 

Gin He Thr Arg 
665 

Ala Val Ala Lys 



Thr Val He Ser 
700 

Arg Ala He Pro 
715 

Ala Glu Tyr Tyr 
730 

Leu Arg Ala Phe 
745 

Arg Gin Val Gly 



Glu Lys Ala Val 
640 

Tyr Glu Val Pro 
655 

Asp Leu Lys Asp 
670 

Arg Leu Ala Ala 
685 

Tyr lie Val Leu 



Phe Asp Glu Phe 
720 

He Glu Asn Gin 
735 

Gly Tyr Arg Lys 
750 

Leu Ser Ala Trp 
765 



<210> 3 
<211> 2325 
<212> DNA 

<213> Pyrococcus kodakaraensis 
<400> 3 

atgatcctcg acactgacta cataaccgag gatggaaagc ctgtcataag aattttcaag 60 
aaggaaaacg gcgagtttaa gattgagtac gaccggactt ttgaacccta cttctacgcc 12 0 
ctcctgaagg acgattctgc cattgaggaa gtcaagaaga taaccgccga gaggcacggg 180 
acggttgtaa cggttaagcg ggttgaaaag gttcagaaga agttcctcgg gagaccagtt 24 0 
gaggtctgga aactctactt tactcatccg caggacgtcc cagcgataag ggacaagata 300 
cgagagcatc cagcagttat tgacatctac gagtacgaca tacccttcgc caagcgctac 360 
ctcatagaca agggattagt gccaatggaa ggcgacgagg agctgaaaat gctcgccttc 420 
gacattgaaa ctctctacca tgagggcgag gagttcgccg aggggccaat ccttatgata 4 80 
agctacgccg acgaggaagg ggccagggtg ataacttgga agaacgtgga tctcccctac 540 
gttgacgtcg tctcgacgga gagggagatg ataaagcgct tcctccgtgt tgtgaaggag 600 
aaagacccgg acgttctcat aacctacaac ggcgacaact tcgacttcgc ctatctgaaa 660 
aagcgctgtg aaaagctcgg aataaacttc gccctcggaa gggatggaag cgagccgaag 720 
attcagagga tgggcgacag gtttgccgtc gaagtgaagg gacggataca cttcgatctc 780 
tatcctgtga taagacggac gataaacctg cccacataca cgcttgaggc cgtttatgaa 840 
gccgtcttcg gtcagccgaa ggagaaggtt tacgctgagg aaataaccac agcctgggaa 900 
accggcgaga accttgagag agtcgcccgc tactcgatgg aagatgcgaa ggtcacatac 960 



gagcttggga aggagttcct tccgatggag gcccagcttt ctcgcttaat cggccagtcc 1020 

ctctgggacg tctcccgctc cagcactggc aacctcgttg agtggttcct cctcaggaag 1080 

gcctatgaga ggaatgagct ggccccgaac aagcccgatg aaaaggagct ggccagaaga 1140 

cggcagagct atgaaggagg ctatgtaaaa gagcccgaga gagggttgtg ggagaacata 1200 

gtgtacctag attttagatc cctgtacccc tcaatcatca tcacccacaa cgtctcgccg 1260 

gatacgctca acagagaagg atgcaaggaa tatgacgttg ccccacaggt cggccaccgc 1320 

ttctgcaagg acttcccagg atttatcccg agcctgcttg gagacctcct agaggagagg 1380 

cagaagataa agaagaagat gaaggccacg attgacccga tcgagaggaa gctcctcgat 1440 

tacaggcaga gggccatcaa gatcctggca aacagctact acggttacta cggctatgca 1500 

agggcgcgct ggtactgcaa ggagtgtgca gagagcgtaa cggcctgggg aagggagtac 1560 

ataacgatga ccatcaagga gatagaggaa aagtacggct ttaaggtaat ctacagcgac 1620 

accgacggat tttttgccac aatacctgga gccgatgctg aaaccgtcaa aaagaaggct 1680 

atggagttcc tcaagtatat caacgccaaa cttccgggcg cgcttgagct cgagtacgag 1740 

ggcttctaca aacgcggctt cttcgtcacg aagaagaagt atgcggtgat agacgaggaa 1800 

ggcaagataa caacgcgcgg acttgagatt gtgaggcgtg actggagcga gatagcgaaa 1860 

gagacgcagg cgagggttct tgaagctttg ctaaaggacg gtgacgtcga gaaggccgtg 1920 

aggatagtca aagaagttac cgaaaagctg agcaagtacg aggttccgcc ggagaagctg 1980 

gtgatccacg agcagataac gagggattta aaggactaca aggcaaccgg tccccacgtt 2 040 

gccgttgcca agaggttggc cgcgagagga gtcaaaatac gccctggaac ggtgataagc 2100 

tacatcgtgc tcaagggctc tgggaggata ggcgacaggg cgataccgtt cgacgagttc 2160 

gacccgacga agcacaagta cgacgccgag tactacattg agaaccaggt tctcccagcc 2220 

gttgagagaa ttctgagagc cttcggttac cgcaaggaag acctgcgcta ccagaagacg 2280 

agacaggttg gtttgagtgc ttggctgaag ccgaagggaa cttga 2325 



<210> 4 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 4 

gaaactctct acgaggaggg cgaggagttc gccg 34 



<210> 5 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 5 

cggcgaactc ctcgccctcc tcgtagagag tttc 34 



<210> 6 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



<400> 6 

gaaactctct acgacgaggg cgaggagttc gccg 



<210> 7 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 7 

cggcgaactc ctcgccctcg tcgtagagag tttc 



<210> 8 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 8 

gaaactctct actacgaggg cgaggagttc gccg 



<210> 9 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 

<400> 9 

cggcgaactc ctcgccctcg tagtagagag tttc 



<210> 10 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

gaaactctct acgccgaggg cgaggagttc gc 

<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 11 

gcgaactcct cgccctcggc gtagagagtt tc 32 



<210> 12 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 12 

gaaactctct acaaggaggg cgaggagttc 3 0 



<210> 13 

<211> 30 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 13 

gaactcctcg ccctccttgt agagagtttc 30 



<210> 14 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 14 

aaagctctct acagggaggg cgaggagttc gc 3 2 



<210> 15 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 
<400> 15 

gcgaactcct cgccctccct gtagagagtt tc 32 



<210> 
<211> 



16 
34 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 16 

gaaactctct actctgaggg cgaggagttc gccg 

<210> 17 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 17 

cggcgaactc ctcgccctca gagtagagag tttc 

<210> 18 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 18 

gaaactctct accaggaggg cgaggagttc gccg 

<210> 19 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 19 

cggcgaactc ctcgccctcc tggtagagag tttc 

<210> 20 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 20 

ggtgttccct tgatgtagca ca 



<210> 21 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 21 

acatgtattt gcatggaaaa caactc 26 



<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 22 

agtgcttcgt gcccgatgac 20 



<210> 23 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :primer 

<400> 23 

tgccccttgg tgacatactc g 21 



<210> 24 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 24 

aaaaacgcgt caccagtcac agaaaagcat cttac 35 



<210> 25 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : primer 



<400> 25 

aaaaacgcgt caaccaagtc attctgagaa tagt 



34 



<210> 26 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 26 

ggattagtat agtgccaatg gssggcga 28 



<210> 27 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 27 

gagggcagaa gtttattccg agctt 25 

<210> 28 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 28 

cgccagggtt ttcccagtca cgac 24 

<210> 29 

<211> 13 

<212> PRT 

<213> Pyrococcus 

<400> 29 

Met Leu Ala Phe Asp lie Glu Thr Leu Tyr His Glu Gly 
15 10 



<210> 30 

<211> 13 

<212> PRT 

<213> Pyrococcus 

<400> 30 



lie Leu Ala Phe Asp lie Glu Thr Leu Tyr His Glu Gly 



10 



<210> 31 

<211> 13 

<212> PRT 

<213> Thermococcus 

<400> 31 

Leu Leu Ala Phe Asp lie Glu Thr Phe Tyr His Glu Gly 
15 10 



<210> 32 

<211> 13 

<212> PRT 

<213> Sulfolobus 

<400> 32 



Arg Val Ala He Asp He Glu Val Tyr Thr Pro Val Lys 
15 10 



<210> 33 

<211> 13 

<212> PRT 

<213> T7 phage 

<400> 33 

Met He Val Ser Asp He Glu Ala Asn Ala Leu Leu Glu 
15 10 



<210> 
<211> 
<212> 
<213> 



34 
13 
PRT 

T4 phage 



<400> 34 



Val Ala Asn Cys Asp He Glu Val Thr Gly Asp Lys Phe 
15 10 



<210> 35 

<211> 7 

<212> PRT 

<213> Pyrococcus 

<400> 35 



or Thermococcus 



Asp He Glu Thr Lys Tyr His 
1 5 



<210> 36 

<211> 7 

<212> PRT 

<213> Pyrococcus or Thermococcus 

<400> 36 

Asp lie Glu Thr Phe Tyr His 
1 5 



<210> 37 

<211> 6 

<212> PRT 

<213> Pyrococcus or Thermococcus 

<400> 37 



Asp He Glu Thr Lys Tyr 
1 5 



